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HUMAN SPASMOLYTIC POLYPEPTIDE IN GLYCOSYLATED FORM 



FIELD OF INVENTION 



The present invention relates to human spasmolytic polypeptide 
in glycosylated form, variants of human and porcine spasmolytic 
5 polypeptides and a method of producing spasmolytic polypeptides 
in glycosylated form. 

BACKGROUND OF THE INVENTION 

Human spasmolytic polypeptide (HSP) belongs to a family of 
peptides containing one or more characteristic trefoil domains 

10 [1]. The trefoil domain is made up of a sequence of 38 or 39 
amino acid residues in which 6 cystein residues are linked in 
the configuration 1-5 , 2-4 and 3-6 thus forming a 
characteristic trefoil structure [1]. The trefoil family of 
peptides consists of rat intestinal trefoil factor, ITF [2], 

15 human breast cancer associated peptide, pS2 [3,4,5], porcine, 
human and murine spasmolytic polypeptide (PSP, HSP, MSP) 
[6,7,8] and frog spasmolysins (xPl, xP2 and xP4) [8,10,11] all 
containing 1, 2 or 4 trefoil domains (Fig. 1) . 

The physiological function of the trefoil peptides is poorly 
20 understood, and so far only PSP has been studied in any detail. 
In the porcine pancreas, PSP is found in the acinar cells and 
to be secreted in large amounts (50-100 mg/ml) into the 
pancreatic juice upon stimulation with pancreozymin or secretin 
[12,13,14]. PSP is resistant to digestion by intestinal 
25 proteases in the gastrointestinal tract [12], and specific 
binding of PSP to rat intestinal mucosa cells and membrane 
preparations from these cells has been demonstrated [15,16]. In 
the porcine gastrointestinal tract, specific receptor-like 
binding to Paneth cells in the duodenum has been found [17]. 
3 0 These results suggest a unique intraluminal function of the 
peptide. A pharmacological screening has indicated that PSP has 
spasmolytic and gastric acid secretion inhibitory effects [18], 
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and studies on mammalian cells have indicated a growth factor- 
like activity of PSP [19]. 

The DNA sequence and derived amino acid sequence of the human 
counterpart of porcine SP is shown in [8]. Unlike PSP, human SP 
5 (Fig, 2), has been found to be expressed .in the stomach, but 
not in the pancreas to any greater extent [ 8 ] . An increased 
expression of HSP and pS2 has been reported to be associated 
with peptic ulcers and mucosal injury in inflammatory bowel 
disease [20,21] indicating a possible healing function of these 
10 peptides. 

Only very limited amounts of HSP can be prepared by extraction 
of human tissue. An object of study resulting in the present 
invention was therefore to prepare recombinant HSP in 
sufficient amounts for physiological and biochemical studies of 
15 the peptide. 

SUMMARY OF THE INVENTION 

It has surprisingly been found that when recombinant HSP is 
produced in certain host organisms, a proportion of it is 
produced in glycosylated form by posttranslational 
20 modifications. The glycosylated form of HSP has not, to 
applicant's best knowledge, been described previously. 

Accordingly, the present invention relates to human spasmolytic 
polypeptide (HSP) which has the amino acid sequence 

Glu Lys Pro Ser Pro Cys Gin Cys Ser Arg Leu Ser Pro His Asn Arg 
25 Thr Asn Cys Gly Phe Pro Gly lie Thr Ser Asp Gin Cys Phe Asp Asn 
Gly Cys Cys Phe Asp Ser Ser Val Thr Gly Val Pro Trp Cys Phe His 
Pro Leu Pro Lys Gin Glu Ser Asp Gin Cys Val Met Glu Val Ser Asp 
Arg Arg Asn Cys Gly Tyr Pro Gly lie Ser Pro Glu Glu Cys Ala Ser 
Arg Lys Cys Cys Phe Ser Asn Phe lie Phe Glu Val Pro Trp Cys Phe 
30 Phe Pro Asn Ser Val Glu Asp Cys His Tyr (SEQ ID N0:1) 
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or a functionally equivalent homologue thereof, characterized 
by being in glycosylated form. 

In the present context, the term "functionally equivalent" is 
intended to indicate that the homologous polypeptide has a 
5 biological activity (e.g. spasmolytic effect) corresponding to 
that of native HSP. The term "homologue" is intended to 
indicate a polypeptide encoded by DNA which hybridizes to the 
same probe as the DNA coding for HSP under comditions of high 
or low stringency (e.g. as described in Sambrook et. al., 

10 Molecular Cloning: A Laboratory Manual , 2nd Ed. , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, 1989) . More 
specifically/ the term is intended to refer to a DNA sequence 
which is at least 60% homologous to the sequence encoding HSP 
with the amino acid sequence shown above. The term is intended 

15 to include modifications of the DNA sequence such as nucleotide 
substitutions which do not give rise to another amino acid 
sequence of the polypeptide, but which correspond to the codon 
usage of the host organism into which the DNA construct is 
introduced or nucleotide substitutions which do give rise to a 

20 different amino acid sequence and therefore, possibly, a 
different protein structure which might give rise to a mutant 
polupeptide with different properties than the native enzyme. 
Other examples of possible modifications are insertion of one 
or more codons into the sequence, addition of one or more 

25 codons at either end of the sequence, or deletion of one or 
more codons at either end or within the sequence. The term 
"glycosylated" is intended to indicate that a carbohydrate 
moiety is present at one or more sites of the protein molecule. 

It is at present- contemplated that glycosylation of HSP may 
3 0 give rise to differences in the biological activity of the 
protein, for instance with respect to stability towards 
proteolytic enzymes in the gastrointestinal tract, solubility 
at gastric and/or intestinal pH compared to non-glycosylated 
HSP, antigenicity, half-life, tertiary structure, and targeting 
35 to receptors on appropriate cells. 
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In another aspect, the present invention relates to a variant 
of a spasmolytic polypeptide (SP) which is a fragment of human 
spasmolytic polypeptide (HSP) or porcine spasmolytic 
polypeptide (PSP) comprising at least one trefoil domain. 

5 The variant SP may be provided in both glycosylated and non- 
glycosylated form* It is at present contemplated that such a 
variant may be advantageous to use instead of full-length SP 
because of a higher specific biological activity, increased 
solubility and stability, longer half -life, easier way of 
10 production, or the like* 

It is assumed that other spasmolytic polypeptides than HSP 
will, if provided with a glycosylation site, also be expressed 
in predominantly glycosylated form. In a further aspect, the 
presfent invention therefore relates to a method of preparing a 

15 spasmolytic polypeptide in at least 60% glycosylated form, 
wherein a host cell transformed with a DNA fragment encoding a 
spasmolytic polypeptide and capable of providing glycosylation 
of said spasmolytic polypeptide is cultured under conditions 
permitting production of said spasmolytic polypeptide and 

20 recovering the resulting spasmolytic polypeptide from the 
culture . 

DETAILED DESCRIPTION OF THE INVENTION 

It has been found that, at least when recombinant HSP is 
produced in yeast, the proportion of it that is provided in 

25 glycosylated form is in N-glycosylated form. It has further 
been found that glycosylation takes place at AsnlS of the 
sequence shown above. In preferred embodiments of glycosylated 
HSP, the glycosylated side chain contains at least one hexose 
unit. In particular, the glycosylated side chain may contain at 

3 0 least one mannose unit, preferably at least five mannose units, 
most preferably at least ten mannose units. In one preferred 
embodiment of glycosylated HSP of the invention, the 
glycosylated side chain contains 13-17 mannose units. In other 
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preferred embodiments, the glycosylated HSP is in addition 
glycosylated with at least one unit of N-acetyl glucosamine 
(GlcNAc) . In the currently preferred embodiment, the 
glycosylated HSP is glycosylated at AsnlS with (GlcNAc) 2 (Man) 10 . 

It is further contemplated to produce homologues of HSP which 
are provided with one or more additional glycosylation sites. 
Thus, the present invention also relates to HSP homologues, 
wherein Lys2 is replaced by Asn, Gln7 is replaced by Asn, ArglO 

10 is replaced by Asn, Gly 20 is replaced by Thr or Ser, Gly23 is 
replaced by Asn, lie 24 is replaced by Asn, Phe 36 is replaced 
by Asn, Asp 37 is replaced by Asn, Ser39' is replaced by Asn, 
Gln53 is replaced by Asn, Glu61 is replaced by Asn, Asp64 is 
replaced by Asn, Arg66 is replaced by Thr or Ser, Gly69 is 

15 replaced by Thr or Ser, Gly72 is replaced by Asn, lie 89 is 
replaced by Thr or Ser, Pro98 is replaced by Asn or VallOl is 
replaced by Thr or Ser, or a combination of two or more of 
these substitutions. In a currently preferred embodiment of 
such an HSP homologue, Asp 6 4 is replaced by Asn, and Arg66 is 

20 replaced by Thr or Ser, 

It is of course understood that HSP homologues of the invention 
may be glycosylated in the same manner at one or more of these 
sites as described above for glycosylation at Asnl5. 

It is assumed that the trefoil structure common among 
25 spasmolytic polypeptides is important for the function of HSP 

and PSP. The variant human or porcine SP comprising a fragment 

of the full-length polypeptide should therefore include at 

least three disulfide bonds to provide this structure. 

Consequently, the variant may comprise at least a sequence of 
30 amino acids from position 8 to 46 or from position 58 to 95, 

each of which sequences defines a trefoil domain of HSP and 

PSP. 
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As indicated above, the SP variant of the invention may be 
provided in non-glycosylated form. This may, for instance, be 
accomplished by substituting AsnlS by another amino acid, e.g. 
Asp or Glu, or by substituting Thrl7 by another amino acid 
5 except Ser, e.g. Ala. It is more likely, however, that one or 
more additional glycosylation sites will be introduced into 
this domain, for instance by replacing ArglO by Asn, Gly 20 by 
Thr or Ser, Gly23 by Asn, lie 24 by Asn, Phe 36 by Asn, Asp 37 
by Asn, or Ser39 by Asn, or a combination of two or more of 
10 these substitutions. 

On the other hand, it may be desirable to provide the trefoil 
domain from position 58 to 95 with a glycosylation site lacking 
in this domain in native HSP and PSP. Thus, Glu61 may be 
replaced by Asn, Asp64 by Asn, Arg66 by Thr or Ser, Gly 69 by 
15 Thr or Ser, or Gly72 is replaced by Asn, or a combination of 
two or more of these substitutions. In a currently preferred 
embodiment of the variant, Asp64 is replaced by Asn, and Arg66 
is replaced by Thr or Ser. 

It is of course understood that variants of the invention may 
2 0 be glycosylated in the same manner at one or more of these 
sites as described above for glycosylation at AsnlS in full- 
length HSP. 

A DNA sequence encoding HSP may suitably be isolated from a 
human genomic DNA library by PGR (polymerase chain reaction) 

25 cloning using primers based on the published cDNA sequence [8]. 
Alternatively, the DNA sequence may be prepared synthetically 
by established standard methods, e.g. the phosphoamidite method 
described by S.L. Beaucage and M.H. Caruthers, Tetrahedron 
Letters 22 , 1981, pp. 1859-1869, or the method described by 

30 Matthes et al., EMBO Journal 3 . 1984, pp. 801-805. According to 
the phosphoamidite method, oligonucleotides are synthesized, 
e.g. in an automatic DNA synthesizer, purified, annealed, 
ligated and cloned in suitable vectors. The cDNA sequence shown 
in [8] may be used as the basis of oligonucleotide synthesis. 
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Alternatively, it is possible to use cDNA coding for HSP 
obtained by screening a human cDNA library with oligonucleotide 
probes in accordance with well-known procedures. 

Furthermore, the DNA sequence may be of mixed synthetic and 
5 genomic, mixed synthetic and cDNA or mixed genomic and cDNA 
origin prepared by ligating fragments of genomic, synthetic or 
cDNA origin (as appropriate) , the fragments corresponding to 
various parts of the entire DNA sequence, in accordance with 
standard techniques. 

10 The SP variant of the invention may be encoded by a fragment 
of the full-length DNA sequence, prepared by one of the methods 
indicated above, or by suitably truncating the full-length 
sequence. 

The DNA sequence encoding HSP or an SP variant of the invention 
15 may then be inserted in a suitable expression vector. The 
recombinant expression vector may be any vector which may 
conveniently be subjected to recombinant DNA procedures, and 
the choice of vector will often depend on the host cell into 
which it is to be introduced. Thus, the vector may be an 
20 autonomously replicating vector, i.e. a vector which exists as 
an extrachromosomal entity, the replication of which is 
independent of chromosomal replication, e.g. a plasmid. 
Alternatively, the vector may be one which, when introduced 
into a host cell, is integrated into the host cell genome and 
25 replicated together with the chromosome (s) into which it has 
been integrated. 

In the vector, the DNA sequence encoding HSP or an SP variant 
of the invention should be operably connected to a suitable 
promoter sequence. The promoter may be any DNA sequence which 
3 0 shows transcriptional activity in the host cell of choice and 
may be derived from genes encoding proteins either homologous 
or heterologous to the host cell. Examples of suitable 
promoters for directing the transcription of the DNA encoding 
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the inhibitor of the invention in mammalian cells are the SV 4 0 
promoter (Subramani et al . , Mol. Cell Biol. JL, 1981, pp. 854- 
864), the MT-1 (metallothionein gene) promoter (Palmiter et 
al., Science 2 22 , 1983, pp. 809-814) or the adenovirus 2 major 
5 late promoter. Suitable promoters for use in yeast host cells 
include promoters from yeast glycolytic genes (Hitzeman et al . , 
J. Biol. Chem. 255 , 1980, pp. 12073-12080; Alber and Kawasaki, 
J. Mol. Appl. Gen. 1, 1982, pp. 419-434) or alcohol 
dehydrogenase genes (Young et al. , in Genetic Engineering of 

10 Microorganisms for Chemicals (Hollaender et al, eds.). Plenum 
Press, New York, 1982), or the TPI1 (US 4, 599, 311) or ADH2-4c 
(Russell et al., Nature 304 , 1983, pp. 652-654) promoters. 
Suitable promoters for use in filamentous fungus host cells 
are, for instance, the ADH3 promoter (McKnight et al., The EMBO 

15 4., 1985, pp. 2093-2099) or the tpi A promoter. 

The "DNA sequence encoding HSP or an SP variant may also be 
operably connected to a suitable terminator, such as the human 
growth hormone terminator (Palmiter et al., op. cit. ) or (for 
fungal hosts) the TPI1 (Alber and Kawasaki, op. cit. ) or ADH3 
20 (McKnight et al. , op. cit. ) promoters. The vector may further 
comprise elements such as polyadenylation signals (e.g. from SV 
40 or the adenovirus 5 Elb region) , transcriptional enhancer 
sequences (e.g. the SV 40 enhancer) and translational enhancer 
sequences (e.g. the ones encoding adenovirus VA RNAs) . 

25 The recombinant expression vector may further comprise a DNA 
sequence enabling the vector to replicate in the host cell in 
question. An examples of such a sequence (when the host cell is 
a mammalian cell) is the SV 4 0 origin of replication, or (when 
the host cell is a yeast cell) the yeast plasmid 2fi replication 

3 0 genes REP 1-3 and origin of replication. The vector may also 
comprise a selectable marker, e,g. a gene the product of which 
complements a defect in the host cell, such as the gene coding 
for dihydrof olate reductase (DHFR) or one which confers 
resistance to a drug, e.g. neomycin, hygromycin or 
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Alternatively, fungal cells may be used as host cells. Examples 
of suitable fungal cells are cells of filamentous fungi, e.g. 
Aspergillus spp. or Neurospora spp. , in particular strains of 
Aspergillus orvzae or Aspergillus niger . The use of ' Aspergillus 
5 spp. for the expression of proteins is described in, e.g., EP 
238 023. 

According to the present method, yeast cells are currently 
preferred for producing HSP and other SPs (such as those shown 
in Fig* 1) , as they have surprisingly been found to produce SP 
10 in a high yield and in at least 60% glycosylated form. For 
instance, about two thirds of the HSP produced by yeast may be 
recovered in glycosylated form. 

The medium used to cultivate the cells may be any conventional 
medium suitable for growing mammalian cells or fungal 

15 (including yeast) cells, depending on the choice of host cell. 
The spasmolytic polypeptide will be secreted by the host cells 
to the growth medium and may be recovered therefrom by 
conventional procedures including separating the cells from the 
medium by centrifugation or filtration, precipitating the 

20 proteinaceous components of the supernatant or filtrate by 
means of a salt/ e.g. ammonium sulfate, purification by a 
variety of chromatographic procedures, e.g. ion exchange 
chromatography or affinity chromatography, or the like. 

The present invention also relates to a pharmaceutical 
25 composition comprising HSP or a variant spasmolytic polypeptide 
of the invention together with a pharmaceutically acceptable 
carrier or excipient. In the composition of the invention, the 
variant may be formulated by any of the established methods of 
formulating pharmaceutical compositions, e.g. as described in 
3 0 Remington's Pharmaceutical Sciences . 1985. The composition may 
typically be in a form suited for oral or rectal administration 
and may, as such, be formulated as tablets or suppositories. 
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HSP or an SP variant of the invention is contemplated to be 
useful for the prophylaxis or treatment of gastrointestinal 
disorders. More specifically, it is contemplated for the 
treatment of gastric or peptic ulcers, inflammatory bowel 
5 disease, Crohn's disease or injury to the intestinal tract 
caused by radiation therapy, bacterial or other infections, 
etc. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is further illustrated in the following example 
10 with reference td the appended drawings in which 

Fig. 1 shows the trefoil family of peptides. Intestinal trefoil 
factor (ITF) contains one trefoil domain [2], as does the 
breast cancer associated pS2 peptide [3,4]. The spasmolytic 
polypeptides from man, pig and mouse contain two trefoil 
15 domains [1,8]. Spasmolysins from Xenopus laevis contain one or 
four trefoil domains [10]. Recently, a member of the frog 
trefoil family containing two domains has been described [11]. 

Fig. 2 shows the proposed structure of human spasmolytic 
polypeptide, HSP. The primary amino acid sequence is taken from 
20 Tomasetto et al. [8], and \he disulphide bonds are placed in 
homology to PSP [1]. \ 

Fig. 3 shows the nucleotide sequence (SEQ ID NO: 2) and 
corresponding amino acid sequence (SEQ ID NO: 3) of the 563 bp 
EcoRI - Xbal fragment encoding the leader - HSP fusion protein. 

25 The Kex 2 processing site is indicated by a vertical arrow. The 
leader and the PCR cloned parts of the HSP gene are shown in 
capital letters, while the synthetic parts are shown i small 
letters. The underlined sequences correspond to the PCR primers 
with horizontal arrows indicating the direction. Restriction 

3 0 sites relevant for the construction are shown. 
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Fig, 4 shows the S. cerevisiae plasmid for the expression and 
secretion of HSP. TPI-prom. and TPI-term. are S. cerevisiae 
triosephosphate isomerase transcription promoter and terminator 
sequences, respectively. POT is a selective marker, the 
5 Schizosaccharomvces pombe triosephosphate isomerase gene. Only 
restriction sites relevant for the construction of the plasmid 
have been indicated. 

Fig. 5 shows reversed-phase HPLC on a Vydac 214TP54 column of 
yeast fermentation broth. The two peaks corresponding to r-HSP 
10 and glycosylated r-HSP are indicated. The dashed line shows the 
concentration of acetonitrile in the eluting solvent. 

Fig. 6 shows ion exchance chromatography on a Fast Flow S 
column of concentrated yeast supernatant. The amount of r-HSP 
and glycosylated r-HSP were determined by the use of the HPLC 
15 system shown in Fig. 5. The bars indicate the fractions pooled 
for further purification of r-HSP and glycosylated r-HSP. The 
dashed line shows the concentration of NaCl in the eluting 
solvent. For details, see Material and Methods. 

Fig. 7 shows the final purification of r-HSP (A) and 
20 glycosylated r-HSP (B) on a preparative reversed-phase HPLC 
Vydac 214TP1022 column. The bars indicate the fractions pooled 
for lyophilization. The dashed lines show the concentration of 
acetonitrile in the eluting solvent. For details, see Material 
and Methods. 

25 Fig. 8 shows reversed-phase HPLC on a Vydac 214TP54 column of 
purified, glycosylated r-HSP (A) and r-HSP (B) . The dashed 
lines show the concentration of acetonitrile in the eluting 
solvent . 

Fig. 9 shows mass spectra of purified r-HSP (A and B) and 
30 glycosylated r-HSP (C and D) . Fig. A and Fig. C show the 
original mass spectrum of r-HSP and glycosylated r-HSP, 
respectively. Fig. B and Fig. D show the reconstructed mass 
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spectrum for r-HSP and glycosylated r-HSP on the basis of Fig. 
A and Fig. C. 



EXAMPLE 



MATERIAL AND METHODS 
5 General methods 




Standard DNA techniques were used as previously described [29]. 
Synthetic oligonucleotides were prepared on an automatic DNA 
synthesizer (380B, Applied Biosystems) using commercially 
available reagents. DNA sequence determinations were performed 
10 by the dideoxy chain-termination technique [30]. Polymerase 
chain reactions (PCR) were performed on a DNA Thermal Cycler 
(Perkin Elmer Cetus) using a commercial kit (GeneAmp, Perkin 
Elmer Cetus) . 

PCR cloning of HSP 

15 The first trefoil domain of HSP was isolated by a PCR reaction 
in which 1 fig human genomic DNA (Clontech, Palo Alto, CA, USA) 
was used as a template. The reaction mixture contained 100 
pmole each of the forward primer 1 (GGCTGAGCCCCCATAACAG) (SEQ 
ID NO: 4) and reverse primer 2 (TGGAAACACCAGGGGAC) (SEQ ID NO: 5) 

20 (Fig. 3) and was carried out in a 100 /xl volume. The cycle was 
: 94°C for l min, 50°C for l min, and 72°C for l min. After 3 0 
cycles a final cycle was performed in which the 72°C step was 
maintained for 10 min. The PCR product, a 115 bp fragment, was 
isolated by electrophoresis on a 2% agarose gel. 

25 The 115 bp PCR fragment was digested with Ddel and then ligated 
to a 31 bp\ duplex formed from the oligonucleotides 
(GAGAAACCCTCCCCCTGCCAGTGCTCCAGGC) (SEQ ID NO: 6) and 
(TCAGCCTGGAGCACTGGC^GGGGGAGGGTTTCTC) . The ligation product was 
amplified by \cr using forward primer 3 
30 (GCTGAGAGATTGGAGAAGAGAGAGAAACCCTCCCCCT) (SEQ ID NO: 7) and 
reverse primer 2. The 3 '\part of primer 3 is identical to the 
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N-terminal Encoding part of the HSP gene and the 5 • part of 
primer 3 is identical to the C-terminal encoding part of the 
hybrid leader gene (Fig. 3). In-frame fusion of the hybrid 
leader gene and the first trefoil domain from HSP was obtained 
by overlay extension PGR [31] . The product was digested with 
EcoRI and Avail and isolated as a 360 bp DNA fragment* 

The second trefoil domain of HSP was PCR-cloned from human 
genoroic\DNA as described for the first domain by replacing 
/ primers I\and 2 with forward primer 4 (TGCGTCATGGAGGTCTC) (SEQ 
10 ID NO: 8) and reverse primer 5 (AGCACCATGGCACTTCAAAG) (SEQ ID 
NO: 9) (Fig A 3) . Reverse primer 5 introduces a Ncol site as a 
silent mutation. The PCR product, a 115 bp fragment, was 
isolated and . cb^gested with Ddel and Ncol resulting in a 91 bp 
fragment. To thi^s fragment were ligated two synthetic duplexes. 
15 The first, encooang the amino acid sequence between the two 
trefoil domains\ consisted of the oligonucleotides 
( GTCCCCTGGTGTTTCCACGCCCTCCCAAAGCAAGAGTCGGATCAGTGCGTCATGGAGGTC ) 
(SEQ ID NO: 10) ahd (TGAGACCTCCATGACGCACTGATCCGACTCTTGCT- 
TTGGGAGGGGGTGGAAACACCAGGG) (SEQ ID NO:ll). The second, a 46 bp 
20 Ncol - Xbal fragment ^encoding the C-terminal part of HSP, 
consisted of the oligonucleotides ( CATGGTGCTTCTTCCCGAACTCTGT- 
GGAAGACTGCCATTACTAAGT ) (SEQ ID NO:12) and 

( CTAGACTTAGTAATGGCAGTCTTCCA^AGAGTTCGGGAAGAAGCAC ) (SEQ ID 



NO: 13). After Avail digestion a 195 bp Avail - Xbal fragment 
25 was isolated. * 

A DNA construct encoding the hybrid leader fused in-frame to 
the entire HSP gene was obtained by ligation of the 360 bp 
EcoRI - Avail fragment and the 195 bp Avail - Xbal fragment 
described above to the 2.7 kb EcoRI - Xbal fragment from vector 
30 pTZ19R [32]. This construct was then transformed into E. coli 
strain MT-172 (r", m*) by selection for resistance to 
ampicillin. DNA sequencing of the resulting plasmid, KFN-1843, 
showed that the correct construction had been obtained. 
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Construction of the HSP secreting yeast strain 

Plasmid KFN-1843 described above was digested with EcoRI and 
Xbal. The resulting 558 bp fragment was isolated and ligated to 
the 9.3 kb Ncol - Xbal fragment and the 1.6 kb Ncol - EcoRI 
5 fragment both from the yeast expression vector pMT-63 6. Plasmid 
pMT-63 6 is derived from the S. cerevisiae - E. coli shuttle 
vector CPOT [25,33] by deletion of the 0.4 kb Hpal - Nrul 
fragment from the Leu-2 gene. The ligation mixture was trans- 
formed into E. coli strain MT-172, and the HSP expression 
10 plasmid, KFN-1847, was isolated (Fig. 4). Plasmid pKFN-1847 was 
transformed into S. cerevisiae strain MT-663 by selection for 
growth on glucose as the sole carbon source. One transf ormant , 
KFN-1852, was selected for fermentation. 

Fermentation 

15 The transf ormant described above was cultivated at 30 *C for 3 
days in yeast peptone dextrose (YPD) medium [40] supplied with 
additional yeast extract (60 g/1) . An OD 650 nm value of 52 was 
reached at the end of the fermentation. 

Purification of r-HSP 

20 The concentration of r-HSP in the yeast fermentation broth and 
fractions obtained during the purification was measured by 
analytical HPLC. Aliquots (usually 50-200 jil) were injected 
onto a Vydac 214TP54 reverse-phase C4 HPLC column (0.46 x 25 
cm) equilibrated at 30° C at a flow rate of 1.5 ml/min with 0.1% 

25 (v/v) TFA in 5% (v/v) acetonitrile. The concentration of 
acetonitrile in the eluting solvent was raised to 65% (v/v) 
over 3 0 min. Absorbance was measured at 28 0 nm. The peaks 
eluting at 15.6 min. and 16.1 min. (Fig. 5) was found by mass 
spectrometry analysis to represent glycosylated r-HSP and 

3 0 unglycosylated r-HSP, respectively. The peptides were 
quantified using a calibrated PSP sample as standard as both 
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peptides contain two Trp and two Tyr out of 106 amino acid 
residues. 

From a 10 litre fermentor, 8 litres of fermentation broth was 
isolated by centrifugation at 3,000 rpm for 10 min. The 
5 supernatant was concentrated to 0.9 litre using an Amicon 
ultrafiltration unit (RA 2000) equipped with an Amicon spiral 
ultrafiltration cartridge type S1Y3, MW cutoff 3,000 (Product 
No, 540620). The pH was adjusted to 1.7 and the conductivity in 
the resulting concentrated sample was measured to 4.7 mS. 

10 The sample was pumped onto a Fast Flow S-Sepharose (Pharmacia) 
column (5 x 11 cm) with a flow rate of 4 0 ml/h. Previous to the 
application , the column was equilibrated in 50 mM formic acid 
buffer, pH = 3.7. After application of the sample, the column 
was washed with 500 ml of 50 mM formic acid buffer, pH = 3.7. 

15 The peptides were eluted from the column by a linear gradient 
between 1.5 litres of 50 mM formic acid buffer, pH = 3.7 and 
1.5 litres of 50 mM formic acid buffer, pH = 3.7 containing 0.6 
M NaCl. Fractions of 10 ml was collected at a flow rate of 40 
ml/h and the absorbance was measured at 280 nm. Fractions were 

20 assayed for the content of r-HSP and glycosylated r-HSP in the 
HPLC-system previously described. The elution profile is shown 
in Fig. 6. Fractions corresponding to r-HSP (fract. Nos. 107- 
128) and glycosylated r-HSP (fract. Nos. 78-95), respectively, 
were pooled. 

25 Glycosylated r-HSP and r-HSP were further purified by 
preparative HPLC chromatography. Pooled fractions (approx. 200 
ml) were pumped onto a Vydac 214TP1022 C4 column (2.2 x 25 cm) 
equilibrated in 0.1% (v/v) TFA. The column was washed with 100 
ml of 0.1% (v/v) TFA in 10% (v/v) MeCN. The peptides were 

3 0 eluted at 25 D C and at a flow rate of 5 ml/min with a linear 
gradient (650 ml) formed from MeCN/H 2 0/TFA (10 . 0 : 89 . 9 : 0. 1 v/v/v) 
and MeCN/H 2 0/TFA (60.0:39.9:0.1 v/v/v). UV-absorption was 
monitored at 280 nm, and fractions corresponding to 10 ml were 
collected and analysed for the content of r-HSP or glycosylated 
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r-HSP. Fig. 7 shows the preparative HPLC purification of r-HSP 
(Fig. 7A) and glycosylated r-HSP (Fig. 7B) . Fractions cor- 
responding to the bars were pooled, and the volume reduced to 
3 0% by vacuum centrifugation. From the two resulting pools, r- 
5 HSP and glycosylated r-HSP were isolated by lyophilization. 

Characterization of r-HSP and glycosylated r-HSP 

Amino acid composition analysis were carried out by hydrolysis 
of 50 /xg peptide with 6M HC1 for 24 h at 110 ft C as previously 
described [6]; no correction for loss during hydrolysis was 

10 carried out. Amino acid sequence analysis was determined by 
automated Edman degradation using an Applied Biosystems Model 
470A gas-phase sequencer [22]. Carbohydrate composition 
analysis was carried out by hydrolysis of 50 /ig peptide with 2M 
HC1 for lh, 2h and 4h at 100 *C and monosaccharides were 

15 separated on a CarboPac PAI (Dionex, Sunnyvale, CA) column (4 
x 250 mm) eluted with 14 mM NaOH. The monosaccharides were 
detected by pulsed amperometric detection (Dionex PAD- 
detector) . The amount of monosaccharides was corrected to zero 
time of hydrolysis and calculated as nmol of monosaccharide per 

20 nmol of peptide. 

Mass spectrometry analysis was performed using an API III 
LC/MS/MS system (Sciex, Thornhill, Ontario, Canada). The triple 
quadrupole instrument has a mass-to-charge (m/z) range of 2400 
and is fitted with a pneumatically assisted electrospray (also 

25 referred to as ion-spray) interface [23,24]. Sample 
introduction was done by a syringe infusion pump (Sage 
Instruments, Cambridge, MA) through a fused capillary (75 fim 
i.d.) with a liquid flow-rate set at 0.5-1 /zl/min. The 
instrument m/z scale was calibrated with the singly-charged am- 

30 monium adduct ions of poly (propylene glycols) (PPG's) under unit 
resolution. 

The accuracy of mass measurements was generally better than 
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RESULTS 

Expression and purification 

DNA fragments encoding the two trefoil domains of HSP were 
isolated by PCR from human genomic DNA using primers based on 
5 the published cDNA sequence [8]. The full length HSP gene was 
obtained from the PCR cloned fragments by addition of synthetic 
DNA fragments. The HSP gene was fused in- frame to a hybrid 
yeast leader sequence by overlay extension PCR [31] (Fig, 3). 
The hybrid leader is based on the mouse salivary amylase signal 
10 peptide [34] and the S . kluweri a mating factor leader 
sequence [35] and is further modified near the Kex 2 cleavage 
site for efficient processing [36, 41]. 

The yeast expression plasmid pKFN-1847 contains the leader-HSP 
gene inserted between the S. cerevisiae triose phosphate 
15 isomerase promoter and terminator [37]. The expression vector 
(Fig. 4) also contains the Schizosaccharomvces pombe TPI gene 
(POT) [38]. 

The plasmid was transformed into the yeast strain MT-663, 
carrying a deletion in the TPI gene, by selecting for growth on 
20 glucose. 

The expression level of r-HSP in the present yeast system is 
approx. 12 0 mg/1. As can be seen from Fig. 5, the yeast su- 
pernatant contains two forms of r-HSP; one eluting at R t = 15.6 
min. and one eluting at R t = 16.1 min. These two forms were 
25 purified separately, and by using the analytical HPLC-system 
(Fig. 5), these two forms can be quantified individually during 
the different steps of the purification. 

After the initial concentration of the yeast supernatant by 
ultrafiltration, the first purification step was cationic 
3 0 exchange chromatography on a Fast Flow S column. Fig. 6 shows 
the elution profile from the column including the amount of r- 



WO 94/17102 



itrr/DKya/uuiMV 



19 

HSP and glycosylated r-HSP determined in the fractions. A 
complete separation of the two forms of r-HSP was obtained in 
this step. 

The fractions from the Fast Flows S column were pooled as 
5 indicated in Fig- 6, and the two peptides were further purified 
by preparative HPLC (Fig. 7) . The r-HSP and glycosylated r-HSP 
were recovered from the fractions indicated in Fig. 7A and Fig. 
7B by vacuum centrifugation and lyophilization. The 
purification is summarized in Table l. The overall yield of r- 
10 HSP and glycosylated r-HSP from 8 litres of fermentation broth 
was 160 mg and 219 mg corresponding to 50% and 34%, 
respectively. 

Characterization of r-HSP and glycosylated r-HSP 

Fig. 8 shows the purity of r-HSP and glycosylated r-HSP as 
15 analysed by analytical HPLC. From these results none of the 
peptides looks completely pure. However, upon rechroraatography 
of material eluting in the minor as well as the major peak, 
similar chromatograms were obtained for both peptides (results 
not shown) . This seems to indicate that the double peak 
20 observed for both r-HSP and glycosylated r-HSP reflects an 
atypical behaviour of these peptides on reverse phase columns 
rather than impurities in the preparations. 

Table 2 shows the amino acid seguencing results obtained on r- 
HSP and glycosylated r-HSP. The average repetitive yield was 

25 94.4% (r-HSP) and 94.6% (glycosylated r-HSP), respectively. In 
both cases the first 4 0 residues of the two peptides were 
confirmed by the sequence analysis. In the glycosylated HSP, no 
PTH-a.a. was found in Edman degradation cycle No. 15. The HSP 
sequence from residue 15-17 (Asn-Arg-Thr) corresponds to a 

30 consensus sequence for N-glycosylation of Asn-15. 

The carbohydrate composition analysis of glysocylated r-HSP 
showed the presence of 12.8 nmol mannose (Man) and 1.6 nmdl of 
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N-acetyl glucoseamine (GlcNAc) per ninol of r-HSP. By peptide 
mapping of r-HSP and glycosylated r-HSP in combination with 
mass spectrometry and sequencing analysis (results not shown) , 
no other residue besides Asn-15 of the glycosylated r-HSP was 
5 found to be modified, i.e. no O-glycosylation was found. 

In Fig. 9, the electro-spray mass spectrometry (ESMS) analysis 
is shown for r-HSP and glycosylated r-HSP. Fig. 9A and 9C are 
original mass spectra displaying characteristics series of 
multiply charged protonated ions always observed in ESMS 

10 spectra of proteins. Fig. 9B and 9D are the corresponding 
computer reconstructed mass spectra from which the molecular 
weight. of individual components may be read directly. As can be 
seen from Fig. 9B, the MW found for r-HSP is 11961.5 ±2 which 
is in very good agreement with a calculated mass of 11961.3. 

15 Fig. 9D shows the reconstructed ion spray mass spectrum of the 
glycosylated r-HSP. From the sequence analysis and the 
carbohydrate composition analysis, it is known that only Asn-15 
is glycosylated and that only two monosaccharide residues, 
mannose and N-acetyl glucoseamine, occur in the glycosylated 

20 form of r-HSP. From these results in combination with the mass 
spectrometry data, it is possible to deduce the different 
glycosylated forms of r-HSP (Table 3). 

Molecular weights corresponding to two series of carbohydrate 
side chains can be deduced from the combination of carbohydrate 

25 composition data and ISMS-data, namely (GlcNAc) 2 (Hex) 10 . 15 and 
(Hex) 13 . 17 (Table 3) . As mannose is the only hexose in the glyco- 
sylated r-HSP, and as Asn-15 is the only glycosylated residue, 
it seems reasonable to conclude that the structure of the 
glycosylation site is Asn- (GlcNAc) 2 - (Man) 1(M5 . The observed Asn- 

30 (Hex) 13 . 17 forms are thus most likely to arise from fragmentation 
in the mass spectrometer, by which the two GlcNAc residues lose 
an acetyl group and are converted into two hexoses. 
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The structure of Asn- (GlcNAc) 2 - (Man) 10 . 15 has previously been 
reported as high mannose type of N-glycosylation for other 
peptides and proteins expressed in yeast [26]. 

Table 1 

5 Purification of r-hSP and glycosylated r-hSP from y ast 
supernatant 



STEP 


Volume 
[ml] 


Amount [mg] 


Yield [%] 


r-hSP 


glycosylated 
r-hSP 


r-hSP 


glycosylated 
r-hSP 


Yeast supernatant 


8000 


320 


640 


100 


100 


Ultrafiltration 


900 


207 


405 


65 


63 


Ion exchange Pool 1 
chromatography 

Pool 2 


160 
220 


182 


275 


57 


43 


Prep HPLC Pool 1 


54 




219 




34 


Pool 2 


80 


160 




50 
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Table 2 

Amino acid sequence analysis of r-hSP and glycosylated r-hSP 







Yield (praol) 


uycic ino. 




_ LCD 

r-nor 


glycosylated r- 
hSP 


l 


1_71U 


4304 


8853 


z 


I 1W 

i*ys 


6925 


8292 


-a 
j 


Pro 


6027 


12837 


A 

4 




2890 


5602 


c 

J 


rio 


4336 


8802 


O 




ND 


ND 


/ 




3388 


5689 


o 
o 


(Cys) 


ND 


ND 


9 


ser 


1279 


2417 


1U 


Arg 


1876 


2523 


11 


Leu 


2277 


4290 


12 


2>cr 


877 


1790 


13 


rto 


1545 


2963 


lH 


nis 


517 


574 




Asn 


1202 


0* 


1 A 


Am 


959 


1471 


1*7 • 
1 / 


i nr 


978 


2172 


IK 
AO 


Acn 


1066 


1509 


1Q 


^ysj 


ND 


ND 






836 


1857 


91 


Phe 


993 


1958 


77 


Pro 


843 


1839 




Glv 


785 


2049 


24 


IlC 


640 


1400 


75 


Thr 


589 


1454 


26 


Ser 


274 


621 


27 


Asp 


581 


1391 


28 


Gin 


445 


952 


29 


(Cys) 


ND 


ND 


30 


Phe 


623 


1562 


31 


Asp 


483 


1210 


32 


Asn 


369 


823 


33 


Gly 


359 


885 


34 


(Cys) 


ND 


ND 


35 


(Cys) 


ND 


ND 


36 


Phe 


422 


1094 


37 


Asp 


268 


783 


38 


Ser 


127 


324 


39 


Ser 


145 


394 


40 


Val 


298 


827 


ND: Not determined 



*: No trace of PTH-Asn or PTH-Asp was seen in cycle No. 15 of glycosylated 
r-hSP. 
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Structure Calculated MW found by 









MW 


ESMS (Fig.9D) 


hSP 


+ 


2 GlcNAc +10 Man 


13989.1 


13989.5 


hSP 


+ 


2 GlcNAc +11 Man 


14151.2 


14151.0 


hSP 


+ 


2 GlcNAc + 12 Man 


14313.4 


14313.5 


hSP 


+ 


2 GlcNAc + 13 Man 


14475.5 


14475.0 


hSP 


+ 


2 GlcNAc +14 Man 


14639.7 


14639.5 


hSP 


+ 


2 GlcNAc + 15 Man 


14799.8 


14801.5 



hSP + 13 Man 


14069.1 


14072.0 


hSP + 14 Man 


14231.3 


14232.5 


hSP + 15 Man 


14393.4 


14393.0 


hSP + 16 Man 


14555.5 


14557.5 


hSP + 17 Man 


14717.7 


14720.0 
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